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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/581 , 976B 



DATE: 11/26/2002 
TIME: 09:26:10 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF4\11262002\l581976B.raw 



4 <110> APPLICANT: Dalemans, Wilfried L.J. 

5 Gerard, Catherine Marie Ghislaine 
7 <120> TITLE OF INVENTION: Vaccine 

10 <130> FILE REFERENCE: B45124 

12 <140> CURRENT APPLICATION NUMBER: 09/581, 976B 

13 <141> CURRENT FILING DATE: 2000-06-20 

15 <150> PRIOR APPLICATION NUMBER :' PCT/EP98 /08563 

16 <151> PRIOR FILING DATE: 1998-12-18 

18 <150> PRIOR APPLICATION NUMBER: GB 9727262.9 

19 <151> PRIOR FILING DATE: 1997-12-24 
21 <160> NUMBER OF SEQ ID NOS : 28 

23 <170> SOFTWARE: FastSEQ for Windows Version 3.0 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 220 

27 <212> TYPE: PRT 

28 <213> ORGANISM: Artificial Sequence 

30 <220> FEATURE: 

31 <223> OTHER INFORMATION: Chimaeric protein (prot 

32 influenza B and E7 from Human papilloma vi 

33 16) 

35 <400> SEQUENCE: 1 



ENTERED 



ein D from Haemoplilus 
rus type 



36 


Met 


Asp 


Pro 


Ser 


Ser 


His 


Ser 


Ser 


Asn 


Met 


Ala 


Asn 


Thr 


Gin 


Met 


Lys 


37 


1 








5 










10 










15 




38 


Ser 


Asp 


Lys 


He 


He 


He 


Ala 


His 


Arg 


Gly 


Ala 


Ser 


Gly 


Tyr 


Leu 


Pro 


39 








20 










25 










30 






40 


Glu 


His 


Thr 


Leu 


Glu 


Ser 


Lys 


Ala 


Leu 


Ala 


Phe 


Ala 


Gin 


Gin 


Ala 


Asp 


41 






35 










40 










45 








42 


Tyr 


Leu 


Glu 


Gin 


Asp 


Leu 


Ala 


Met 


Thr 


Lys 


Asp 


Gly 


Arg 


Leu 


Val 


Val 


43 




50 










55 










60 










44 


He 


His 


Asp 


His 


Phe 


Leu 


Asp Gly 


Leu 


Thr 


Asp 


Val 


Ala 


Lys 


Lys 


Phe 


45 


65 










70 










75 










80 


4 6' 


Pro 


His 


Arg 


His 


Arg 


Lys 


Asp 


Gly 


Arg 


Tyr 


Tyr 


Val 


He 


Asp 


Phe 


Thr 


47 










85 










90 










95 




48 


Leu 


Lys 


Glu 


He 


Gin 


Ser 


Leu 


Glu 


Met 


Thr 


Glu 


Asn 


Phe 


Glu 


Thr 


Met 


49 








100 










105 










110 






50 


Ala 


Met 


His 


Gly 


Asp 


Thr 


Pro 


Thr 


Leu 


His 


Glu 


Tyr 


Met 


Leu 


Asp 


Leu 


51 






115 










120 










125 








52 


Gin 


Pro 


Glu 


Thr 


Thr 


Asp 


Leu 


Tyr 


Cys 


Tyr 


Glu 


Gin 


Leu 


Asn 


Asp 


Ser 


53 




130 










135 










140 










54 


Ser 


Glu 


Glu 


Glu 


Asp 


Glu 


He 


Asp 


Gly 


Pro 


Ala 


Gly 


Gin 


Ala 


Glu 


Pro 


55 


145 










150 










155 










160 


56 


Asp 


Arg 


Ala 


His 


Tyr 


Asn 


He 


Val 


Thr 


Phe 


Cys 


Cys 


Lys 


Cys 


Asp 


Ser 


57 










165 










170 










175 
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DATE: 11/26/2002 
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58 

59 

60 

61 

62 

63 

65 

66 

67 

68 

70 

71 

72 

73 

75 

76 

77 

78 

79 

80 

81 

82 

83 

84 

85 

86 

87 

89 

90 

91 

92 

94 

95 

96 

97 

99 

100 

101 

102 

103 

104 

105 

106 

107 

108 

109 

110 

111 



Input Set : A:\seqlist.txt 

Output Set: N:\CRF4\11262002\l581976B.raw 

Thr Leu Arg Leu Cys Val Gin Ser Thr His Val Asp lie Arg Thr Leu 

180 185 190 

Glu Asp Leu Leu Met Gly Thr Leu Gly lie Val Cys Pro lie Cys Ser 

195 200 205 

Gin Lys Pro Thr Ser Gly His His His His His His 
210 215 220 

<210> SEQ ID NO: 2 
<211> LENGTH: 663 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Chimaeric protein (protein D from Haemoplilus 
influenza B and E7 from Human papilloma virus type 
16) 

<400> SEQUENCE: 2 
atggatccaa gcagccattc atcaaatatg gcgaataccc aaatgaaatc agacaaaatc 
attattgctc accgtggtgc tagcggttat ttaccagagc atacgttaga atctaaagca 
cttgcgtttg cacaacaggc tgattattta gagcaagatt tagcaatgac taaggatggt 
cgtttagtgg ttattcacga tcacttttta gatggcttga ctgatgttgc gaaaaaattc 
ccacatcgtc atcgtaaaga tggccgttac tatgtcatcg actttacctt aaaagaaatt 
caaagtttag aaatgacaga aaactttgaa accatggcca tgcatggaga tacacctaca 

tttgcaacca gagacaactg atctctactg ttatgagcaa 
atagatggtc cagctggaca agcagaaccg 
tgttgcaagt gtgactctac 
actttggaag acctgttaat 
ccaactagtg gccaccatca 



ggaggatgaa 
tgtaaccttt 
agacattcgt 



gcttcggttg 
gggcacacta 
ccatcaccat 



ttgcatgaat atatgttaga 
ttaaatgaca gctcagagga 
gacagagccc attacaatat 
tgcgtacaaa gcacacacgt 
ggaattgtgt gccccatctg ttctcagaaa 
taa 

<210> SEQ ID NO: 3 
<211> LENGTH: 822 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Chimaeric protein (protein 
influenza B and E6 from Human papilloma virus 
16) . 

<4 00> SEQUENCE: 3 
atggatccaa gcagccattc atcaaatatg gcgaataccc aaatgaaatc agacaaaatc 
attattgctc accgtggtgc tagcggttat ttaccagagc atacgttaga atctaaagca 
cttgcgtttg cacaacaggc tgattattta gagcaagatt tagcaatgac taaggatggt 
cgtttagtgg ttattcacga tcacttttta gatggcttga ctgatgttgc gaaaaaattc 
ccacatcgtc atcgtaaaga tggccgttac tatgtcatcg actttacctt aaaagaaatt 
caaagtttag aaatgacaga aaactttgaa accatggcca tgtttcagga cccacaggag 
cgacccagaa agttaccaca gttatgcaca gagctgcaaa caactataca tgatataata 
ttagaatgtg tgtactgcaa gcaacagtta ctgcgacgtg aggtatatga ctttgctttt 
cgggatttat gcatagtata tagagatggg aatccatatg ctgtatgtga taaatgttta 
aagttttatt ctaaaattag tgagtataga cattattgtt atagtttgta tggaacaaca 
ttagaacagc aatacaacaa accgttgtgt gatttgttaa ttaggtgtat taactgtcaa 
aagccactgt gtcctgaaga aaagcaaaga catctggaca aaaagcaaag attccataat 



112 ataaggggtc ggtggaccgg tcgatgtatg tcttgttgca gatcatcaag aacacgtaga 



D from Haemoplilus 
type 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
663 
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113 gaaacccagc tgactagtgg ccaccatcac catcaccatt aa 822 

116 <210> SEQ ID NO: 4 

117 <211> LENGTH: 273 

118 <212> TYPE: PRT 

119 <213> ORGANISM: Artificial Sequence 

121 <220> FEATURE: 

122 <223> OTHER INFORMATION: Chimaeric protein (protein D from Haemoplilus 



TOO 

iZo 




influenza \ 


B and E6 


from Human papilloma virus type 






T O A 

1Z4 




16) 






























T o c 
1Z D 


<4 00> SEQUENCE: 


A 
H 
























LZ f 


Met 


Asp 


Pro 


Ser 


C ■>- 

ber 


LJ n ci 

HIS 


C r-N -v 

ber 


ber 


Asn 


Met 


Ala 


Asn 


Thr 


bin 


Met 


Lys 


TOO 

IZo 


T 
1 








o 










10 










15 




1 O Q 


Ser 


Asp 


Lys 


T T ^ 

He 


T I - 

lie 


lie 


Ala 


His 


Arg 


Gly Ala 


Ser 


Gly 


Tyr 


Leu 


Pro 


130 








o n 
Z U 










£ 3 










jU 






1 O 1 


blU 


His 


Thr 


Leu 


blU 


ber 


Lys 


Ala 


Leu 


Ala 


Pne 


Ala 


Gin 


bin 


Ala 


Asp 


132 






jj 










a n 










45 








TOO 

X 5 J 


Tyr 


Leu 


blU 


bin 


Asp 


Leu 


7\ 1 -i 

Ala 


Met 


l nr 


Lys 


Asp 


Gly Arg 


Leu 


X 7 T 

Val 


val 


1 O A 

lo4 




jU 










D D 










60 












T 1 

lie 


His 


Asp 


nlS 


Fne 


Leu 


Asp 


biy 


Leu 


1 nr 


Asp 


Val 


Ala 


Lys 


Lys 


rne 


IOC 

lob 


DO 










/ u 










75 












TOO" 

lo / 


Pro 


HIS 


Arg 


HIS 


Arg 


Lys 


Asp 


biy 


Arg 


Tyr 


Tyr 


Val 


He 


Asp 


rne 


Thr 


TOO 

loo 










O D 










90 














i on 

i jy 


Leu 


Lys 


blU 


lie 


bin 


C /-x V" 

ber 


Leu 


blU 


Met 


Thr 


Glu 


Asn 


Phe 


blU 


i nr 


Met 


t a n 
1 4 U 








100 










ins 










1 1 n 






1 A 1 

1 41 1 


Ala 


Met 


Phe 


Gin 


Asp 


Pro 


bin 


blU 


Arg 


Pro 


Arg 


Lys 


Leu 


Pro 


bin 


Leu 


T /l O 

14 Z 






115 










120 










125 








143 


Cys 


Thr 


Glu 


Leu 


Gin 


Thr 


Thr 


He 


His 


Asp 


lie 


He 


Leu 


Glu 


Cys 


Val 


144 




130 










135 










140 










145 


Tyr 


Cys 


Lys 


Gin 


Gin 


Leu 


Leu 


Arg 


Arg 


Glu 


Val 


Tyr 


Asp 


Phe 


Ala 


Phe 


146 


145 










150 










155 










160 


147 


Arg 


Asp 


Leu 


Cys 


He 


Val 


Tyr 


Arg 


Asp 


Gly 


Asn 


Pro 


Tyr 


Ala 


Val 


Cys 


148 










165 










170 










175 




149 


Asp 


Lys 


Cys 


Leu 


Lys 


Phe 


Tyr 


Ser 


Lys 


He 


Ser 


Glu 


Tyr 


Arg 


His 


Tyr 


150 








180 










185 










190 






151 


Cys 


Tyr 


Ser 


Leu 


Tyr 


Gly 


Thr 


Thr 


Leu 


Glu 


Gin 


Gin 


Tyr 


Asn 


Lys 


Pro 


152 






195 










200 










205 








153 


Leu 


Cys 


Asp 


Leu 


Leu 


He 


Arg 


Cys 


He 


Asn 


Cys 


Gin 


Lys 


Pro 


Leu 


Cys 


154 




210 










215 










220 










155 


Pro 


Glu 


Glu 


Lys 


Gin 


Arg 


His 


Leu 


Asp 


Lys 


Lys 


Gin 


Arg 


Phe 


His 


Asn 


156 


225 










230 










235 










240 


157 


He 


Arg 


Gly 


Arg 


Trp 


Thr 


Gly 


Arg 


Cys 


Met 


Ser 


Cys 


Cys 


Arg 


Ser 


Ser 


158 










245 










250 










255 




159 


Arg 


Thr 


Arg 


Arg 


Glu 


Thr 


Gin 


Leu 


Thr 


Ser 


Gly 


His 


His 


His 


His 


His 


160 








260 










265 










270 






161 


His 

































164 <210> SEQ ID NO: 5 

165 <211> LENGTH: 1116 

166 <212> TYPE: DNA 

167 <213> ORGANISM: Artificial Sequence 
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169 <220> FEATURE: 

170 <223> OTHER INFORMATION: Chimaeric protein (protein D from Haemoplilus 

171 influenza B and E6E7 fusion from Human papilloma 

172 virus type 16) 

174 <400> SEQUENCE: 5 

175 atggatccaa gcagccattc atcaaatatg gcgaataccc aaatgaaatc agacaaaatc 60 

176 attattgctc accgtggtgc tagcggttat ttaccagagc atacgttaga atctaaagca 120 

177 cttgcgtttg cacaacaggc tgattattta gagcaagatt tagcaatgac taaggatggt 180 

178 cgtttagtgg ttattcacga tcacttttta gatggcttga ctgatgttgc gaaaaaattc 240 

179 ccacatcgtc atcgtaaaga tggccgttac tatgtcatcg actttacctt aaaagaaatt 300 

180 caaagtttag aaatgacaga aaactttgaa accatggcca tgtttcagga cccacaggag 360 

181 cgacccagaa agttaccaca gttatgcaca gagctgcaaa caactataca tgatataata 420 

182 ttagaatgtg tgtactgcaa gcaacagtta ctgcgacgtg aggtatatga ctttgctttt 480 

183 cgggatttat gcatagtata tagagatggg aatccatatg ctgtatgtga taaatgttta 540 

184 aagttttatt ctaaaattag tgagtataga cattattgtt atagtttgta tggaacaaca 600 

185 ttagaacagc aatacaacaa accgttgtgt gatttgttaa ttaggtgtat taactgtcaa 660 

186 aagccactgt gtcctgaaga aaagcaaaga catctggaca aaaagcaaag attccataat 720 

187 ataaggggtc ggtggaccgg tcgatgtatg tcttgttgca gatcatcaag aacacgtaga 780 

188 gaaacccagc tgatgcatgg agatacacct acattgcatg aatatatgtt agatttgcaa 840 

189 ccagagacaa ctgatctcta ctgttatgag caattaaatg acagctcaga ggaggaggat 900 

190 gaaatagatg gtccagctgg acaagcagaa ccggacagag cccattacaa tattgtaacc 960 

191 ttttgttgca agtgtgactc tacgcttcgg ttgtgcgtac aaagcacaca cgtagacatt 1020 

192 cgtactttgg aagacctgtt aatgggcaca ctaggaattg tgtgccccat ctgttctcag 1080 

193 aaaccaacta gtggccacca tcaccatcac cattaa 1116 

195 <210> SEQ ID NO: 6 

196 <211> LENGTH: 371 

197 <212> TYPE: PRT 

198 <213> ORGANISM: Artificial Sequence 

200 <220> FEATURE: 

201 <223> OTHER INFORMATION: Chimaeric protein {protein D from Haemoplilus 

202 influenza B and E6E7 fusion from Human papilloma 

203 virus type 16) 

205 <400> SEQUENCE: 6 

206 Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 

207 1 5 10 15 

208 Ser Asp Lys lie lie lie Ala His Arg Gly Ala Ser Gly Tyr Leu Pro 

209 20 25 30 

210 Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 

211 35 40 45 

212 Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val 

213 50 55 60 

214 lie His Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 

215 65 70 75 80 

216 Pro His Arg His Arg Lys Asp Gly Arg Tyr Tyr Val lie Asp Phe Thr 

217 85 90 95 

218 Leu Lys Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met 

219 100 105 110 

220 Ala Met Phe Gin Asp Pro Gin Glu Arg Pro Arg Lys Leu Pro Gin Leu 

221 115 120 125 
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222 
223 
224 
225 
226 
227 
228 
229 
230 
231 
232 
233 
234 
235 
236 
237 
238 
239 
240 
241 
242 
243 
244 
245 
246 
247 
248 
249 
250 
251 
252 
253 
255 
256 
257 
258 
260 
261 
262 
263 
265 
266 
267 
268 
269 
270 
271 
272 
273 





1 1 n 


VJlU 


Ton 

lie u 


OX 1 1 


Thr 


1 111 


lie 


MIS 


Asp 


He 


1 its 


Leu 


Glu 




val 




1 -J U 










1 O J 










I 4 f) 

I I \j 










iyr 




Lys 




bin 


LGU 


Leu 


Arg 


Arg 


Glu 


Val 


T\/T 

i yr 


Asp 


Phe 


nld 


IT I1C 












1 so 










155 










i fin 


7\ y~rt 




Leu 




Tin 

1 1 e 


V a 1 


iyr 


Arg 


Asp 


Gly Asn 


D v- /^S 

Jri o 


Tyr 


Ala 


Vol 


Pwo 

L,y s 




















1 /U 










1 / O 




Asp 


Lys 


^.ys 


Leu 


Lys 


rne 


Tyr 


ber 


Lys 


Tl - 

lie 


Ser 


pl n 
LjlU 


Tyr 


Arg 


nlS 


iyr 








1 Rn 










1 0 0 










190 






Cys 


Tyr 


Ser 


Leu 


Tyr 


pi 

biy 


i nr 


rp v> v 

i nr 


Leu 


blU 


bin 


bin 


Tyr Asn 


Lys 


Pro 






1 y 0 










^UU 










ZUO 








Leu 


Cys 


Asp 


Leu 


Leu 


Tl fl 

lie 


Arg 


Cys 


T 1 - 

lie 


Asn 


bys 


pi n 
bin 


Lys 


Pro 


Leu 


Cys 




Z 1U 










z i o 










ZzU 










Pro 


Pin 

blU 


Pin 

blU 


Lys 


bin 


Arg 


nlS 


Leu 


Asp 


Lys 


Lys 


plv, 

bin 


Arg 


rne 


nlS 


Asn 


OOt: 
Z Z O 




















o o c 
Zoo 










z f± u 


lie 


Arg 


biy 


Arg 


Trp 


l nr 


pi ,. 

biy 


Arg 


Cys 


Met 


Ser 


Cys 


Cys 


Arg 


O -v 

ber 


O 

ber 










Z *1 O 










250 










ZOO 




Arg 


1 nr 


Arg 


Arg 


pi „ 
blU 


rn"U -w- 

l nr 


pi „ 
bin 


Leu 


Met 


His 


Gly 


Asp 


Thr 


Pro 


i nr 


Leu 








o c, n 










OCR 
Z DO 










270 






nlS 


Pin 


Tyr 


Met 


Leu 


Asp 


Leu 


pi n 
bin 


Pro 


Glu 


Thr 


i nr 


Asp 


Leu 


Tyr 


Cys 






O T C 
Z /O 










o o n 
za U 










285 








Tyr 


p i i -i 
blU 


P 1 n 

bin 


Leu 


Asn 


Asp 


ber 


ber 


blU 


Glu 


Glu 


Asp 


Glu 


He 


Asp 


biy 




290 










z y o 










oUU 










Pro 


Ala 


Gly 


Gin 


Ala 


Glu 


Pro 


Asp 


Arg 


Ala 


His 


Tyr 


Asn 


He 


Val 


Thr 


305 










310 










315 










320 


Phe 


Cys 


Cys 


Lys 


Cys 


Asp 


Ser 


Thr 


Leu 


Arg 


Leu 


Cys 


Val 


Gin 


Ser 


Thr 










325 










330 










335 




His 


Val 


Asp 


He 


Arg 


Thr 


Leu 


Glu 


Asp 


Leu 


Leu 


Met 


Gly 


Thr 


Leu 


Gly 








340 










345 










350 






He 


Val 


Cys 


Pro 


He 


Cys 


Ser 


Gin 


Lys 


Pro 


Thr 


Ser 


Gly 


His 


His 


His 






355 










360 










365 








His 


His 


His 






























370 






























<210> SEQ ID 


NO: 


7 

























<211> LENGTH: 663 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Chimaeric protein (protein D from Haemoplilus 
influenza B and mutated E7 from Human papilloma 
virus type 16) 

<400> SEQUENCE: 7 
atggatccaa gcagccattc atcaaatatg gcgaataccc aaatgaaatc 



attattgetc accgtggtgc tageggttat 

cttgcgtttg cacaacaggc tgattattta 

cgtttagtgg ttattcacga tcacttttta 

ccacatcgtc ategtaaaga tggccgttac 

caaagtttag aaatgacaga aaactttgaa 



agacaaaatc 
atctaaagca 
taaggatggt 



ttaccagagc ataegttaga 
gagcaagatt tagcaatgac 
gatggcttga ctgatgttgc gaaaaaattc 
tatgtcatcg actttacctt aaaagaaatt 
accatggcca tgcatggaga tacacctaca 



ttgcatgaat atatgttaga tttgeaacca gagacaactg atetctaegg ttatcagcaa 
ttaaatgaca gctcagagga ggaggatgaa atagatggtc cagctggaca agcagaaccg 



60 
120 
180 
240 
300 
360 
420 
480 
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